AITopics | ann training

Note how the most dominant accuracy flows arise fromX3, which is the only bias-free feature in the dataset. In contrast, the largest bias flows arise fromX1 and X2, both of which are heavily biased features.

artificial intelligence, dataset, machine learning, (17 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)

Add feedback

Software for ANN training on a Ring Array Processor

Neural Information Processing SystemsApr-6-2023, 19:28:01 GMT

Experimental research on Artificial Neural Network (ANN) algorithms requires either writing variations on the same program or making one monolithic program with many parameters and options. By using an object-oriented library, the size of these experimental programs is reduced while making them easier to read, write and modify. An efficient and flexible realization of this idea is Connection(cid:173) ist Layered Object-oriented Network Simulator (CLONES). CLONES runs on UNIX1 workstations and on the 100-1000 MFLOP Ring Array Processor (RAP) that we built with ANN algorithms in mind. In this report we describe CLONES and show how it is implemented on the RAP.

ann training, ring array processor, software, (1 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.69)

Add feedback

Controlled Descent Training

Andersson, Viktor, Varga, Balázs, Szolnoky, Vincent, Syrén, Andreas, Jörnsten, Rebecka, Kulcsár, Balázs

arXiv.org Artificial IntelligenceMar-16-2023

In this work, a novel and model-based artificial neural network (ANN) training method is developed supported by optimal control theory. The method augments training labels in order to robustly guarantee training loss convergence and improve training convergence rate. Dynamic label augmentation is proposed within the framework of gradient descent training where the convergence of training loss is controlled. First, we capture the training behavior with the help of empirical Neural Tangent Kernels (NTK) and borrow tools from systems and control theory to analyze both the local and global training dynamics (e.g. stability, reachability). Second, we propose to dynamically alter the gradient descent training mechanism via fictitious labels as control inputs and an optimal state feedback policy. In this way, we enforce locally $\mathcal{H}_2$ optimal and convergent training behavior. The novel algorithm, \textit{Controlled Descent Training} (CDT), guarantees local convergence. CDT unleashes new potentials in the analysis, interpretation, and design of ANN architectures. The applicability of the method is demonstrated on standard regression and classification problems.

artificial intelligence, machine learning, training dynamic, (18 more...)

arXiv.org Artificial Intelligence

2303.09216

Country:

North America > United States > New York (0.04)
North America > United States > Michigan (0.04)
North America > United States > Iowa (0.04)
(2 more...)

Genre: Research Report (0.64)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Gradient Descent (0.55)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.46)
Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (0.34)

Add feedback

A Time-to-first-spike Coding and Conversion Aware Training for Energy-Efficient Deep Spiking Neural Network Processor Design

Lew, Dongwoo, Lee, Kyungchul, Park, Jongsun

arXiv.org Artificial IntelligenceAug-8-2022

In this paper, we present an energy-efficient SNN architecture, which can seamlessly run deep spiking neural networks (SNNs) with improved accuracy. First, we propose a conversion aware training (CAT) to reduce ANN-to-SNN conversion loss without hardware implementation overhead. In the proposed CAT, the activation function developed for simulating SNN during ANN training, is efficiently exploited to reduce the data representation error after conversion. Based on the CAT technique, we also present a time-to-first-spike coding that allows lightweight logarithmic computation by utilizing spike time information. The SNN processor design that supports the proposed techniques has been implemented using 28nm CMOS process. The processor achieves the top-1 accuracies of 91.7%, 67.9% and 57.4% with inference energy of 486.7uJ, 503.6uJ, and 1426uJ to process CIFAR-10, CIFAR-100, and Tiny-ImageNet, respectively, when running VGG-16 with 5bit logarithmic weights.

accuracy, activation function, spike, (15 more...)

arXiv.org Artificial Intelligence

doi: 10.1145/3489517.3530457

2208.04494

Country: Asia > South Korea > Seoul > Seoul (0.04)

Genre: Research Report (0.82)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Software for ANN training on a Ring Array Processor

Kohn, Phil, Bilmes, Jeff, Morgan, Nelson, Beck, James

Neural Information Processing SystemsDec-31-1992

Experimental research on Artificial Neural Network (ANN) algorithms requires either writing variations on the same program or making one monolithic program with many parameters and options. By using an object-oriented library, the size of these experimental programs is reduced while making them easier to read, write and modify. An efficient and flexible realization of this idea is Connectionist Layered Object-oriented Network Simulator (CLONES).

international computer science institute, processor, ring array processor, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > California > San Mateo County > San Mateo (0.04)
North America > United States > California > Alameda County > Berkeley (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Object-Oriented Architecture (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Software for ANN training on a Ring Array Processor

Kohn, Phil, Bilmes, Jeff, Morgan, Nelson, Beck, James

Neural Information Processing SystemsDec-31-1992

Experimental research on Artificial Neural Network (ANN) algorithms requires either writing variations on the same program or making one monolithic program with many parameters and options. By using an object-oriented library, the size of these experimental programs is reduced while making them easier to read, write and modify. An efficient and flexible realization of this idea is Connectionist Layered Object-oriented Network Simulator (CLONES).

international computer science institute, processor, ring array processor, (12 more...)

Neural Information Processing Systems

Country:

North America > United States > California > San Mateo County > San Mateo (0.04)
North America > United States > California > Alameda County > Berkeley (0.04)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Object-Oriented Architecture (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

Add feedback

Software for ANN training on a Ring Array Processor

Kohn, Phil, Bilmes, Jeff, Morgan, Nelson, Beck, James

Neural Information Processing SystemsDec-31-1992

Experimental research on Artificial Neural Network (ANN) algorithms requires either writing variations on the same program or making one monolithic program with many parameters and options. By using an object-oriented library, the size of these experimental programs is reduced while making them easier to read, write and modify. An efficient and flexible realization of this idea is Connectionist LayeredObject-oriented Network Simulator (CLONES).

artificial intelligence, machine learning, object-oriented architecture, (15 more...)

Neural Information Processing Systems

Country: North America > United States (0.28)

Technology: